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ABSTRACT 

Aims. Extending the knowledge about the properties of solar cycles into the past is essential for understanding the solar dynamo. This 
paper aims at estimating areas of sunspots observed by Schwabe in 1825-1867 and at calculating the tilt angles of sunspot groups. 
Methods. The sunspot sizes in Schwabe’s drawings are not to scale and need to be converted into physical sunspot areas. We employed 
a statistical approach assuming that the area distribution of sunspots was the same in the 19th century as it was in the 20th century. 
Results. Umbral areas for about 130,000 sunspots observed by Schwabe were obtained, as well as the tilt angles of sunspot groups 
assuming them to be bipolar. There is, of course, no polarity information in the observations. The annually averaged sunspot areas 
correlate reasonably with sunspot number. We derived an average tilt angle by attempting to exclude unipolar groups with a minimum 
separation of the two alleged polarities and an outlier rejection method which follows the evolution of each group and detects the 
moment it turns unipolar at its decay. As a result, the tilt angles, although displaying considerable scatter, place the leading polarity 
on average 5?85 ± 0?25 closer to the equator, in good agreement with tilt angles obtained from 20th-century data sets. Sources of 
uncertainties in the tilt angle determination are discussed and need to be addressed whenever different data sets are combined. The 
sunspot area and tilt angle data are provided online. 
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1. Introduction 

Solar activity is apparently driven by internal magnetic fields, 
which are roughly oscillatory in time. Sunspots are the most ob¬ 
vious manifestat ions of solar acti vity in visible light, and it was 
Samuel Heinrich lSchwabel (Il844t) who first published a paper on 
the abundance of sunspots as a cyclic phenomenon. 

Apart from the number of sunspots and the various indices 
that can be defined from their appearance, there are other prop¬ 
erties that contain information on the underlying process of gen¬ 
erating variable magnetic fields in the solar interior. The most 
prominent feature is the distribution of sp ots in latitude versus 

]ll863ll. The latitudes of the 


time (butterfly diagram; [ 
spots give us an idea of the location of the underlying magnetic 
fields. In the majority of attempts to explain the dynamo pro¬ 
cess of the Sun, it is assumed that it is strong azimuthal magnetic 
fiel ds which emerge as sunspots at the solar surface (for a review, 
see lCharbonneaul2010l) . These internal horizontal (i.e. parallel to 
the solar surface) fields become locally unstable and form loops 
eventually penetrating the surface of the Sun. At this stage, two 
polarities are formed, which are actually measur ed and ofte n ac¬ 
companied by sunspot groups (for a review, see lFanll2004l) . Al¬ 
ternatively, sunspots may form as a consequence of a large-scale 
magnetic field suppressing the convective motions and thereby 
reducing the turbulent pressure. The lower pressure at the field 
location compresses the flux even further leadin g to further tur¬ 
bulence suppression, and an instability can occur (IKleeorin et al.l 
ll989HWarnecke et al.ll2013l) . 

The radial magnetic field within active regions provides 
poloidal fields to the dynamo system. The production of poloidal 


magnetic flux is an essential ingredient to the sustainability of 
the Sun’s large-scale magnetic field. The angle the group polar¬ 
ities form with t he solar equator is called the tilt angle and was 
first measured bv iHale et al.l ( |1919t) . On average, the leading po¬ 
larity of the group is slightly closer to the solar equator than the 
following one. The dependence of the average tilt angle on the 
emergence latitude of the sunspot groups is often referred to as 
Joy’s law, according to the paper mentioned above. 

Tilt an gles from sunsp ots in white-light images were com- 
puted by Howard (Il991l) from Mt. Wilson images and by 
ISivaraman et al.l (Il999l) from Kodaikanal i mages. From those 
data, averag e tilt angles were obtained by iDasi-Esnuig et al.l 
(l201d.l20r^ for the solar cycles 15-21. An anti-correlation be¬ 
tween the average tilt angle and the amplitude of the correspond¬ 
ing cycle was found. Additionally, the product of this average 
and the cycle amplitude correlates significantly with the strength 
of the next cycle. We will come back to more recent tilt angle 
determinations in Sects. I43] andl5] 

Ba sed on magnetograms from Kitt Peak, IWang & SheelevI 
(Il989l) determined tilt angles for cycle 21 and obtained 
a large average va lu e of 10°, a result confirmed by 
Stenflo & KosovichevI (1201 2l) from MDI data. Recently, 


Wang et al.l ( 20151) compared tilt angles from white-light im¬ 


ages of the Debrecen Photoheliographic Database and from 
Mt. Wilson magnetograms for cycles 21-23. They found that 
magnetogram tilt angles tend to be larger than the ones from 
sunspot groups in white-light images, both because a substantial 
fraction of the white-light tilt angles refer to sunspots of the 
same polarity, and because the magnetograms include magnetic 
flux from plage regions typically showing higher tilts than the 
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Fig. 1. Example of the drawing style in the main period of 1831-1867. 
This full-disk drawing of 1847 July 22 shows spots with and without 
penumhrae. The drawing also shows two group designations (116 and 
117) which actually refer to two individual groups each (see Sect.O 
also for the treatment of more difficult cases such as group 113). 

sunspots of the same active region. We will address the first 
issue later in this Paper. 

This Paper is ba sed on the digitized observations by Samuel 
Heinrich Schwabe (lArldl20ll1) of cycles 7-10 and extends the 
subsequent measurements of all positions and estim ates of the 
sizes of the sunspots drawn in these manuscripts (lArlt et alJ 
l2013h . The initial sizes were in arbitrary units corresponding to 
pixel areas in the digital images and may not be to scale. We 
will describe the method of converting these sunspot size esti¬ 
mates into physical areas in Sect.|2] an attempt at defining proper 
sunspot groups in Sect. |3] the computations of the tilt angles in 
Sect. |4]and will summarize the results in Sect.|5] 

2. Calibration of sunspot areas 

Apart from the importance to have reliable sunspot area infor¬ 
mation for the Schwabe period, we also need to know the indi¬ 
vidual sunspot areas for reasonable estimates of the two polari¬ 
ties and their locations in sunspot groups when determining the 
tilt angles of sunspot groups. The sun spot areas may be seen 
as proxies for the magnetic fl ux (e.g. [Houtgast & van SluitersI 
Il948t [^ngnes & JensenllT960i for early studies), although the 
relation of the two may be complex, as emphasized recently by 
iTlatov & Pevtsovl(l201^ . 

Schwabe plotted the sunspots into relatively small circles of 
about 5 cm diameter, representing the solar disk. Given the finite 
width of a pencil tip, at least small spots must have been plotted 
with an area larger than a corresponding structure on the Sun 
would have at that scale. Pores, if plotted to scale, would need to 
have diameters of 0.04-0.1 mm in such a drawing. The umbral 
areas measured in the drawings therefore need to be converted 
into physical areas on the Sun. 

There are two ways of obtaining physical sizes of the 
sunspots drawn by Schwabe. The one requires the existence of 



Fig. 2. Example of the drawing style in the initial period of 1825-1830. 
This full-disk drawing of 1827 June 13 shows large spots which com¬ 
bine several umbrae and at least part of the penumbral area, as is re¬ 
vealed by the detailed drawings. 

Class 

1 2 3 4 5 6 7 8 9 10 11 12 

• ■oooooooOOO 

Area in pixels 

5 9 21 37 69 97 145 185 221 270 308 364 

Fig. 3. Twelve cursor shapes (size classes) used for the size estimates 
of the sunspots drawn by Schwabe. The bottom line gives the area in 
square pixels on the screen, including the black border pixels. 

high-resolution drawings by other observers within the observ¬ 
ing period of Schwabe for calibration. The other is a statisti¬ 
cal approach using data sets of the 20th century to calibrate the 
sizes. We will first describe the latter method, since the num¬ 
ber of high-resolution drawings by other observers that can be 
employed for the first method is very limited. The statistical ap¬ 
proach also required a splitting of the data into two sets: 1825- 
1830 and 1831-1867. This is due to a change in the drawing style 
after 1831 Jan 1, as demonstrated in Figs.[T]and|3 In the initial 
period, Schwabe plotted spots without distinction of umbrae and 
penumhrae. In the second period starting in 1831, Schwabe dis¬ 
tinguished umbrae from penumhrae. In those cases, umbral sizes 
were measured. We begin with the second period when areas are 
clearly umbral and describe afterwards a work-around for the 
conversion of sunspot sizes to umbral areas in the initial period 
until 1830. 


2.1. Indirect umbral areas for 1831-1867 

One approach to obtaining physical areas of the sunspot sizes 
is of a statistical nature. Sunspots were divided into 12 classes 
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by size, as introduced by lArlt et all (l2013h . The measurements 
were actually catTied out with twelve different circular cursor 
shapes having areas from 5 to 364 square pixels. Their shapes 
and screen areas are shown in Fig. [3 The classes increase 
monotonously in area, but were chosen relatively arbitrarily. Any 
more precise individual pixel-counts of umbral areas are unlikely 
to yield more accurate data, since the drawings are meant to be 
sketches of the sunspot distributi ons and sizes rat her than exact 
copies. More details are given bv lArlt et al.l (l2013h . 

The division into 12 size classes is done both for Schwabe’s 
dataset and for a modern data set (see below), whereby for the 
modern dataset the classes are chosen such that each class has 
the same relative number of sunspots as the corresponding class 
in Schwabe’s data. In other words, the relative abundances of the 
twelve size classes of the Schwabe spots are compared and cali¬ 
brated with 20th century data by building histograms of 12 artifi¬ 
cial size classes constructed to contain the identical abundances. 
The average umbral area in such an artificial class gives us the 
umbral area corresponding to a Schwabe size class. Finally, a 
function for the area depending on the heliocentric angle of a 
spot from the centre of the solar disk is fitted for each size class. 
We will refer to this angle as ‘disk-centre distance’ 6 in the fol¬ 
lowing. 

The reference data sets used to obtain a statistical mapping of 
sunspot umbral areas are from photoheliographic data of the ob¬ 
servatories of Debrecen, Mt. Wilson, Ko daikan a l, and the MDl 
instrument of SOHO. As described by iGvoril (Il998h . an im¬ 
proved automatic analysis method was used for the Debrecen 
data starting in 1988. Before that, from 1974 to 1987, the ar¬ 
eas were measured by video techniques (iDezso et al.lll987() . In 
the 1974-1987 data, the area values of larger sunspots at disk- 
centre distances d > 60° increase very rapidly with 6. This effect 
is not seen in the area values measured from 1988 onward, so 
we have used the Debrecen da ta from 1988-2013 o nly. The Mt. 
Wilson data were analysed by [Howard et al.l (1 19841) and contain 
spot properties from 1917-1985. The Kodaikanal data covering 
1906-1987 w ere obtained in almos t the same way as the Mt. 
Wilson ones (ISivaraman et al.iri993h . The MDI data for 1996- 
2010 were obtained using a telescope of 4 arc seconds resolution 
d Watson et al.ll201 Tl) which is similar to or perhaps a bit worse 
than the resolving capabilities of Schwabe’s setup. Note that the 
data from the Greenwich Photoheliographic Database were not 
used because it contains group area totals instead of areas of in¬ 
dividual spots. 

Typical diameters o f solar pores in white light are be tween 
1000 km and 4000 km ([Kennens & Martinez Pillet|ll996l) . That 
converts to 0.8-13 ■ 10^ km^ or 0.26—4.1 millionths of the solar 
hemisphere (MSH). We are using a lower limit of 1 MSH for the 
construction of the histogram as argued below. In the Debrecen 
data, integer values of the projected area in millionths of the so¬ 
lar disk are given (0,1,2,...), while the corrected areas are given 
in MSH, also as integer values. Since the projection correction 
increases the area, whereas the conversion to MSH reduces the 
value by half, the lower limits of 1 in both quantities are there¬ 
fore statistically compatible. We discarded all spots smaller than 
1 MSH from the other data sets before the analysis. Ideally, one 
would want to define a minimum spot size plotted by Schwabe, 
but in reality, his drawing style was not that straight-forward. 
Whenever he detected a small group on the Sun, he indicated its 
location by small dots. In more complex groups, however, he did 
not indicate all the small spots because of their abundance. There 
is apparently no clear lower limit for the spot size. We therefore 
use a compromise at this point, excluding the smallest pores, and 


start from 1 MSH which is also the lower limit in the Debrecen 
data. 

The relative abundances of the twelve cursor size classes, 
denoted by i = 1,2,..., 12, are determined for three different 
ranges of disk-centre distances, which were 6 < 30°, 30°-60°, 
and 60°-70°. These distance classes are numbered as = 1,2,3. 
The upper limit of 70° is due to the fact that not all reference 
data sets contain spots beyond that distance. The four reference 
data sets (this number will be denoted by N in the following) are 
now also divided into twelve classes fulfilling the same relative 
abundances as obtained for the Schwabe classes, again split into 
the three selected ranges of distances. The histograms are based 
on the umbral areas which are corrected for foreshortening. We 
therefore expect a mapping of size classes with a fairly small 
dependence on 6. 

Then the area for each cursor size is calculated by the un¬ 
weighted average of all spots 

_ N S„ij I N 

Aid - EE Anjid E S„id , (1) 

«=1 j=\ ' n=\ 

where Aid is the area for a cursor of i-th size class in the d-th 
disk-centre distance class, is the umbral area (corrected for 
projection by cos ' 6) of the y'-th spot in the n-th data source, i-th 
size class and d-t\\ distance class, and Snid is the total number of 
spots present in n-th data source, i-th size class in d-th distance 
class. Note that these averages consist of different histogram bins 
for different n. For example, the equivalent class-5 bin in the De¬ 
brecen data will have other area limits than the equivalent class-5 
bin of the MDI data. The averaging helps smooth possible sys¬ 
tematic over- or underestimations of areas in the 20th-century 
data sources. The averages are not immediately areas cor¬ 
rected for foreshortening, since the histogram classes are con¬ 
structed using Schwabe’s raw sunspot sizes. We will capture any 
possibly remaining disk-centre distance dependence in Sect. 12.41 
where functions through the three distance classes for each i will 
be derived, i.e. 12 functions for the 12 size classes. 

2.2. Indirect umbral areas for 1825-1830 

The sunspots in the early full-disk drawings from 1825 to 1830 
were not drawn at a good resolution. Instead, Schwabe plotted 
high-resolution magnifications at unknown scale beside the full- 
disk drawings. The magnifications show that nearby spots were 
combined in the full-disk drawings. The spots in these drawings 
are therefore often ‘blobs’ made out of very close spots and in¬ 
cluding the penumbrae between those spots. Hence, the pencil 
dots used to measure the sizes of those spots do not represent 
their umbral area. To estimate the area for these spots, we need to 
compare the size statistics with grouped spots including penum¬ 
brae. To do that, we combined the spots inside a single penumbra 
in a modern data set and used these for the statistical estimation 
of the area. 

Among the data used here, the Debrecen data is the only 
source which contains umbral and penumbral area s broken down 
into individual spots. Recentlv. lTlatov et al.l(l2014l) published de¬ 
tailed measurements of the Kislovodsk Mountain Astronomical 
Station. Since those only cover the somewhat peculiar cycle 24, 
we prefer not to use them for the area calibration of Schwabe’s 
drawings. The conversion of the 1825-1830 data is therefore 
based on the Debrecen data (1988-2013) as a mixture of differ¬ 
ent cycles. From that source, we prepared a data set in which all 
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the umbrae inside a continuous penumbra are added and consid¬ 
ered as a single spot of which we store the whole-spot area and 
the total umbral area in each spot. Now we divide the whole- 
spot areas into 12 classes with the same relative abundances as 
we obtained from Schwabe’s 1825-1830 data, but use the cor¬ 
responding umbral areas for an average according to Eq. ([1]) for 
each size class in each distance class. The results for the distance 
classes are again combined into a function of the disk-centre dis¬ 
tance in Sect. 12.41 

There are still differences between these combined spots 
from the Debrecen data and the pre-1831 Schwabe spots. (1) A 
penumbra with a single umbra would look similar to a penumbra 
with two umbrae in Schwabe’s drawings. The penumbral area 
between the two umbrae in the latter case, however, will lead 
to different total umbral areas when derived from the Debrecen 
data. (2) It is not always the entire penumbra which Schwabe 
plotted in a ‘blob’. (3) All the umbrae inside a penumbra are 
added in the Debrecen data, even for very extended penumbrae. 
Schwabe, however, did not club together all the umbrae inside 
a connected penumbra when it was very large, but plotted ‘sub- 
penumbrae’ in that case. 

There are 38 spots apparently drawn without the inclusion 
of a penumbra which were found from the visual comparison of 
disk drawings with magnifications. These spots were not used in 
the procedure described here; their areas were calculated using 
the method discussed in Sect. 12.11 

2.3. Umbral areas from parallel observations 

For a short period from 1850 Sep 19 to Nov 4, large-scale draw¬ 
ings are also available from Sestini who observed from Wash¬ 
ingto n, D.C., comprising a total of 42 full-disk graphs (ISestinil 
Il853h . We compiled spots seen by both Schwabe and Sestini 
and measured the umbral areas of the corresponding spots in 
Sestini’s drawings. There were not enough observations to cover 
the whole range of size classes - we could only cover classes 
1-6. We will come back to the results in the following Section. 
One has to note though that the time difference between the ob¬ 
servations from Dessau and the ones from Washington implies 
changes in the evolution of the spots either leading to wrong ar¬ 
eas or to wrong spot associations between the two observers. 

2.4. Final mapping of sunspot sizes 

Figure |4] shows the mappings of Schwabe size classes to phys¬ 
ical areas in km^ and MSH for three different ranges of disk- 
centre distances for the statistical conversion as well as a map¬ 
ping for the calibration with concurrent high-resolution obser¬ 
vations. One immediate result is that the direct conversion of 
the pencil spots in Schwabe’s drawings into sunspot areas would 
lead to overestimated umbral areas in most size classes. One can 
also see that the areas corresponding to the pixel areas do not 
form a linear function or power law. The areas from the compar¬ 
ison with the sunspots observed by Sestini in 1850 are in good 
agreement with the indirect mapping based on the size distribu¬ 
tion. This shows that the direct conversion of pixel areas into 
sunspot areas is not a good choice. The only exception is class 5, 
but it contains only 21 measurements and may be a poor estimate 
(as is the one for class 6). 

The lower panel of Fig. |4] appears to show much smaller ar¬ 
eas. This is not true, however. The graph actually shows that a 
5-MSH spot which was typically drawn as a class-3 dot in 1831- 
1867, was represented by a class-5 dot before that period, since 



Schwabe square pixels 



Schwabe square pixels 


Fig. 4. Average areas in km^ (left ordinate) and MSH (right ordinate) 
for each of the twelve size classes (abscissa), and for three different 
centre-distance ranges: < 30° (diamonds), 30°-60° (squares), and 60°- 
70° (circles). The areas corresponding to the simple pixel areas of the 
size classes without any calibration are indicated by plus signs. The 
direct calibration of size classes by observations from Sestini in 1850 
are shown as triangles. Top: average areas for the data from 1831-1867. 
Bottom: average areas for the data from 1825-1830. An open symbol 
means that the number of spots used in the calculation of the average 
area is less than twenty. Note that the lower curves in the lower panel 
do not imply smaller areas, but they rather mean that any spot of a given 
true area were drawn as a spot of larger class in 1825-1830 than in 
1831-1867. 

it encompassed a considerable fraction of the penumbra at that 
time. The areas of class 3 in the top panel cannot be compared 
with the areas of class 3 in the lower panel. Sunspot sizes cor¬ 
responding to large size classes were much more often used by 
Schwabe in the period of 1825-1830 than afterwards (note that 
there are no longer open symbols with fewer than twenty spots 
in the lower panel). 

The dependence of final areas on the disk-centre distance is 
described by functions of the form 

Jti{6) = a,- + (bil cos (5), (2) 

where a, and bi are coefficients for the i-th size class and 6 is 
the distance of the spot from the centre of the solar disk. In the 
end, there are 12 functions for the period 1825-1830 and another 
12 functions for the period 1831-1867. They deliver a mapping 
of the size classes into physical areas. 
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Fig. 5. The distribution of estimated umbral areas of sunspots. Top: 
The area distribution for 1831-1867. The solid curve shows the log- 
normal fit. The dash ed c urve and dash dot c urve are the fit curves form 
iBogdan et al.llll988h and iKiess et al.l ll2014h . respectively. The horizon¬ 
tal bars show the widths of the bins. The error margins on the distri¬ 
bution values are all smaller than the symbols. Bottom: The area distri¬ 
bution of spots for ascending phases, descending phases, cycle minima 
and maxima of all cycles within 1831-1867. The lines show the log¬ 
normal fits for the corresponding area distributions. 


When computing the areas for the final sunspot data base, the 
area is not calculated if a spot distance is greater than 85° from 
the disk centre, since the area values become very uncertain. All 
spots with 6 < 70° are reliable in the sense that they are covered 
by the statistics leading to the mapping. All spots with 70° < 6 < 
85° are uncertain because they rely on an extrapolation of the 
mapping, while all spots with d > 85° are highly uncertain and 
are therefore excluded from the data base. The smallest sunspot 
area occurring in our data after applying Eq. da is 1 MSH which 
is consistent with the initial lower limit for spots in the reference 
data sets. 

2 . 5 . Distribution of sunspot area 

The data base of Schwabe’s observations contains a total of 
135 921 entries comprising 134386 spots with size estimates 
(each line corresponds to an individual umbra) as well as 
1535 spotless days (each line corresponds to a day with zero 
spot size). Whenever we use the term ‘spots’ we refer to individ¬ 
ual umbrae as far as Schwabe resolved them. Positions are not 


available for 849 umbrae because the orientation of the drawing 
could not be identified. No physical areas are available because 
d is missing. There are another 41 umbrae beyond 85° from the 
centre of the disk for which areas are not calculated because 
of too large uncertainties. The area values are therefore avail¬ 
able for 133 496 umbrae. To study the distribution of areas, we 
consider the spots within +50° central meridian distance (CMD) 
and within ±45° lati tude. The same latitude limit was chosen by 
[Bogdan e tal. ( T988h. The c entral meridian limit, however, was 
±7?5 in Bogdan et al.l(Il988h in order to avoid duplicate counts of 
groups (a group will typically appear only once in a 15° window 
because of the solar rotation). Since drawings by Schwabe are 
not available on all days, we need to widen this window reason¬ 
ably and found +50° a good compromise between not missing 
too many groups due to observing gaps on the one hand and the 
contamination by duplicate consideration of groups on the other 
hand. The l atter will actually not affect the distribution s ignifi - 
cantly, since iBaumann & SolariO (l2005l) an d iKiess et alJ (1201 4t) 
did not see any drastic changes between counting umbrae only 
once and counting them on every day they were visible. The low¬ 
est area considered for the distribution is our lowest area, 1 MSH. 
The above criteria reduce the data to a sample of 104217 spots 
in total, and 96 984 spots from 1831 to 1867. 

The umbral are a spectrum was obtained as described by 
iBogdan et al.l dl988h with the exception that they used a lower 
umbral area limit of 1.5 MSH. Since the distribution is differen¬ 
tial, the different lower limit should not affect the shape of the 
curve. The bins for small areas were selected such that each bin 
encompasses approximately one size class up to class 9, whereas 
one additional bin was defined such that it contains the spots 
from class 10 and about half the spots in class 11. All spots even 
larger than that (> 185 MSH) were outside the above mentioned 
CMD window. Divi ding the area range into twenty bins as in 
IBogdan et al.l (Il988ll would have caused a strong scatter since 
the Schwabe areas accumulate near twelve typical area values, 
because the dependences on the disk-centre distance described 
by (|2]) are all small. 

The area distribution of the Schwabe spots also resembles 
a log-normal distri bution and looks similar to the curve by 
IBogdan et al.l (Il988h . The parameters for such a distribution over 
the area A are obtained through a fit by the function 
\2 




(In A - In {A)r 
2 In cta 


+ ln 




(3) 


where (dA/dA)max is the maximum of the area distribution func¬ 
tion, (A) is the mean, and cta is the geometric standard devia¬ 
tion. Table [T] shows the log-normal fit parameters obtained with 
a Levenberg-Marquardt least-squares method. The cycle minima 
and maxi ma were taken from the “Average” column of the cycle 
timings bv iHathawavl (l20 1 Ol) . Minima periods and maxima peri¬ 
ods are defined as ± 1 yr around the minima/maxima, while the 
ascending and descending phases are the remaining periods. 

The top panel of Fig. |5] shows the resulting total area dis¬ 
tribution of umbrae for 1831-1867. The e rrors on the ordinate 
values were estimated by (AA/AA)/ VaA, where AA/AA is the 
discrete area distribution value and AN is the number of spots 
in each bin. The errors a re all smaller t han th e symbols. The 
lower curve from Fig. 1 in IBogdan et alJ (Il988h . which is the fit 
to th e full range of umb ral areas of 1.5-141 MSH, and the curve 
from iKiess et alJ (1201 4l) are also plotted in Fig.|5]for comparison. 
While the data of both analyses also influence our size calibra¬ 
tion of Schwabe’s sun s pots, t he distribution only agrees with the 
one by IBogdan et al.l (Il988h based on Mt. Wilson data. Inter¬ 
estingly, also the area distributions obtained from group umbral 


Article number, page 5 of[l3] 























































































A&A proofs: manuscript no. schwabe_tilt 


Table 1. Log-normal fit parameters for the Schwabe data and various 
subsets of them. 


Data 

Umbrae 

<A> 

[MSH] 

O-A 

[MSH] 

/dVt 

V ^ / max 

[MSH-1] 

All data (1825-1867) 

104217 

1.05 

3.8 

3.8 

1825-1830 

7233 

0.58 

9.9 

1.7 

1831-1867 

96984 

1.10 

3.5 

4.2 

Cycle 7 

9448 

1.09 

5.3 

1.3 

Cycle 8 

22 382 

1.08 

3.8 

4.0 

Cycle 9 

36 862 

1.08 

3.0 

5.3 

Cycle 10 

35181 

1.10 

3.4 

4.9 

Ascending phases 

13613 

1.09 

3.5 

3.7 

Descending phases 

46763 

1.10 

3.1 

4.7 

Cycle minima 

3 507 

1.08 

3.4 

1.4 

Cycle maxima 

31131 

1.10 

3.6 

7.6 



1820 1830 1840 1850 1860 1870 

Year 


areas dBaumann & Solan^l2005l) agree fairly well with our re¬ 
sults. The peak position (A) of their log-normal distribution from 
the Greenwich group data is ten times larger than ours from in¬ 
dividual spot data, in good agreement with the fact that sunspot 
groups consist of roughly ten spots on average. 

The bottom panel of Fig. |5] shows four individual area dis¬ 
tributions for the ascending and descending phases of the cy¬ 
cles as well as for the cycle minima and maxima. They ex¬ 
hibit nearly the same distribution as the whole area distribu¬ 
tion. The descending phases of all cycles have larger (dAf/dA)inax 
than the ascending phases, however. The fairly small variation 
of (Ta is rema r kable and confirms the correspo nding findings by 
[Bogdan et al.l (Il988b and lSchad & PennI(|2010|) . 

The umbral area distribution was calculated for the data from 
1825-1830 and from 1831-1867 separately, and the fit parame¬ 
ters are given in Table [1] The distribution for the ea rlier period 
is muc h wider, with cta being similar to the one by iKiess et ^ 
(l2014l) who derived oa — 10.7 (cr = 1.54 in their work). 

3. Group definitions 

Tilt-angles of groups will be sensitive to the actual association 
of spots into groups. Schwabe’s original drawings contain group 
names which he started from number one every new year. His 
perception of a group was often too broad. A fair number of 
sunspot clusters actually contain two or more groups. This new 
definition of the groups was made by manual inspection of the 
drawings. We also used the evolutionary information of the clus¬ 
ters and sub-clusters provided by the images of adjacent days. 
Very often, a small apparently new bipolar group emerged near 
an existing one and showed its individual evolution through the 
Waldmeier (or Zurich) types. Schwabe included them in the 
group number of the existing group, while we defined a new 
group in many such cases. In other cases, when splitting of 
the polarities was not obvious and the parts of the group were 
all in the same evolutionary phase, we kept Schwabe’s defini¬ 
tion, despite leading to somewhat large groups of 30° extent or 
more. Any splitting, however, would have been very arbitrary 
and would add noise rather than new information to the tilt angle 
data base. A total of 56 groups with extents > 30° size remained. 

Figure |6] shows the ratio of the number of newly formed 
groups, i.e. those obtained by splitting Schwabe’s original 
groups, to the original number of groups. For instance, if there 
were a total of 10 groups which were all split into two, resulting 
in 10 new groups, the ratio would be one (100% splitting). The 


Fig. 6. The number of new groups obtained by splitting Schwabe’s orig¬ 
inal groups, normalised to the number of groups before splitting (i.e. 
Schwabe’s original groups). The criteria for splitting a group are de¬ 
scribed in the main text. 



Year 


Fig. 7. Annual averages of the number of spots per group calculated be¬ 
fore regrouping of sunspot groups (circles), after regrouping (triangles), 
and manually counted number of spots from the magnification drawings 
of sunspot groups (diamonds). Uncertainties are only given for the val¬ 
ues after regrouping and are obtained from the relative Poissonian error 
1 / V”couiits. where Ucounts are the number of all instances of all groups 
in a given year (groups count several times with different numbers of 
spots). 

large fraction of splittings seen after 1850 is partly due to the 
presence of very many closely located groups, but chiefly due 
to a wider definition of groups Schwabe adopted during those 
cycles. In nine cases, Schwabe assigned one group designation 
each to two spots, while they apparently form one bipolar group. 
We combined those cases to one group name. 

The splittings are marked by modified group names in the 
above mentioned data file. The two groups 116 and 117 in Fig.[T] 
will now appear as four groups, 116-0, 116-1, 117-0, and 117-1 
in the catalogue. Combined groups are marked with plus signs 
in the new group name, e.g. 39 h-40. 

To demonstrate the impact of the regrouping (splitting as 
well as combining), Fig.|7]shows the annual averages of the num¬ 
ber of spots per group, derived from Schwabe’s group definitions 
and derived after regrouping by visual inspection. Additionally, 
the sunspot group magnifications drawn by Schwabe (see Fig.|2l) 
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Table 2. Statistics of coverage, spot areas, groups, and tilt angles a for Schwabe’s observations in 1825-1867. CMD is tbe central meridian 
distance of tbe area weighted centre of a group. 


Period 

Days with 
drawings 

Gaps longer 
than 5 days“ 

Gross groups 
with areas 

Unique groups 
with areas 

Groups 
with tilt 

Group tilts with 
|CMD| < 60° 

Group tilts with 
|CMD| < 60° 
and a > 3° 

1825-1830 

1187 (63.0%) 

32 

4401 

945 

2452 

2159 

1745 

1831-1867 

8808 (65.2%) 

149 

27116 

5903 

20689 

17 365 

13 803 

Total 

9995 (64.9%) 


31517 

6848 

23141 

19524 

15 548 


Notes. Tbe gaps are derived only from tbe days for wbicb we obtained data; if we include tbe unused drawings, tbe number of gaps longer than 
five days is a bit smaller than tbe number given in tbis column. One group is missing in tbe 1825-1830 number because a gap straddles 1830 
and 1831. 




Fig. 8. Top: Annual averages of total-disk umbral area of sunspots in 
MSH and divided by three (solid line), the group sunspot number (GSN) 
derived from our groupings (dashed line), the sunspot number (SSN; 
Wolf number) derived from our groupings and the actual number of 
spots in the full-disk drawings (dash-dotted line), and the official Inter- 
na tional Sunspot Num ber (dotted line) of before the revision suggested 
bv IClette et al.l l l2014h which was not yet available. The gray horizon¬ 
tal bar indicates the period in which the number of spots per group was 
fairly constant (cf. Fig.|7}. Bottom: Daily total umbral areas versus GSN 
(in case of Schwabe, the GSN is based on our grouping in Schwabe’s 
drawings, corresponding to the dashed line in the upper panel), averaged 
over 100 values in each group sunspot number bin for the Schwabe data 
(circles) and the Greenwich data (asterisks). The solid and dashed lines 
are linear fits through the Schwabe and Greenwich data, respectively, 
being enforced to go through the origin. 


were used to compute a third set of average spot numbers per 
group. Since the magnifications are not biased towards excep¬ 
tionally large groups until 1830, we give only the averages for 
1826-1830. These spot numbers per group obtained from de¬ 
tailed drawings_of individual groups match modern values very 
well (lTlatovl2013HClette et all2014l) . After 1830, only selected, 
big groups were magnified, so the values of spots per group are 
biased. 

In the averages derived from the full-disk drawings, there is a 
significant increase in the number of spots per group from 1830 
to 1836. This increase was not flattened after the regrouping of 
sunspot groups. It may be partly due to initially smaller true 
numbers of spots per group and partly due to the early, coarser 
drawing style of Schwabe. The most notable jump from 1835 
to 1836 does not coincide, however, with the change in drawing 
style in 1830/1831. An increase in the number of spots per group 
can also be seen after the other minima, namely in 1843-1847, 
1856-1858, and 1866-1867. Therefore, the change in drawing 
style and the recovery from the activity minimum in 1833 are 
probably superimposed effects. 

By the same token, we may spot small peaks coinciding with 
the solar cycle maxima 8, 9, and 10 in Fig.|7] This effect has also 
been observed - even more drastically - by IClette et alJ (l2014l) 
in 20th-century data, and it may actually be a mixture of a real 
effect and observational bias (basically because on a crowded 
Sun, the splitting of groups is difficult). 

The upper panel of Fig. 0 shows the annual averages of um¬ 
bral areas of sunspots. They are compared to the yearly averages 
of the group sunspot number (GSN) according to our own group 
number information and the (Wolf or Zurich) sunspot number 
(SSN), both derived from Schwabe’s observations, as well as to 
the International Sunspot Number (ISN)[3 Good agreement is 
found between umbral areas and Schwabe’s group sunspot num¬ 
ber for cycle 8, while the areas of cycles 7 and 9 fall below the 
(rescaled) sunspot numbers and cycle 10 has larger areas than the 
sunspot numbers indicate. Since Schwabe’s observing method, 
telescope and drawings are very constant after 1835, the differ¬ 
ence between the ISN and the Schwabe record may be due to 
calibration issues of the ISN before 1849 (iLeussu et al.ll2013l) . 

In an attempt to assess the correlation of the umbral areas 
with the group sunspot number, we plot averages of 100 daily 
all-disk umbral areas versus the corresponding group sunspot 

' Note that the GSN includes a scaling of 12.08 derived from the com¬ 
parison of the ISN with the groups found i n the Greenwich Photoheli¬ 
ographic Database dHovt & Schattenll 19981) . Since the ISN was scaled 
down to match Wolf’s observations, who recorded about 60% of the 
sunspots that would be reported today, the SSN from Schwabe’s data 
can actually lie above the GSN. This is the case when Schwabe’s draw¬ 
ings are bit more detailed than Wolf’s reports. 
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number of the same days in the lower panel of Fig. [8] The same 
was done for the Greenwich photoheliographic database which 
co ntains umbral areas unti l 1976. The graph is similar to the one 
bv iBalmaceda et aP (l2009t) who used the total spot areas instead. 
There is an intrinsic scatter in the correlation due to a certain 
randomness if both the sunspot number and sunspot areas are 
related to an internal magnetic field, rather than to each other. 
The uncertainty from the randomness has been reduced to about 
10% by averaging over 100 days. Although the scatter in total 
umbral areas is higher than that due to randomness (as seen in 
the bottom panel of Fig. EJl, it is comparable to the results from 
the Greenwich data, except for a slight tendency to larger areas 
(lines of linear fits through the origin were added for clarity in 
Fig. [8]|. We therefore conclude that the areas inferred from the 
Schwabe data are compatible with the Greenwich data, which 
did not enter our calibration at any step. The Schwabe areas do 
show, however, cycle-to-cycle variations in the strength of the 
correlation with the sunspot number indices. 

The final total numbers of groups as well as the numbers of 
groups that will have tilt angles (see Sect. [Hi are given in Table|2l 



Fig. 9. Histogram of the ‘tilt angles’ of supposedly unipolar groups in 
the Schwahe data, selected by a maximum ‘polarity’ separation of A/i < 
1?5. 


4. Tilt-angles of groups 

4.1. Determination of tiit and separation 

The tilt angle of a given sunspot group is calculated in a plane 
tangential to the solar surface in an estimated mid-point of that 
particular group to avoid problems with the curvilinear helio¬ 
graphic coordinates. The mid-point of the group (hereafter box- 
centre as opposed to the area-weighted centre of gravity of the 
group) is obtained using the easternmost and westernmost spots 
as well as the northernmost and southernmost spots of a given 
group. The longitude and latitude of the box-centre is set to be 
the contact point of the tangential plane with the solar surface. 
The Cartesian coordinates in this plane are x, and y,- for the /-th 
spot and Xg and for the box-centre and are normalised with 
respect to the solar diameter. 

The algorithm then checks for the number of spots in a 
group. If it is equal to two, it proceeds to calculate directly the 
tilt angle and polarity separation. If it is more than two spots, we 
have to assess the most probable configuration of which spots be¬ 
long to which polarity, since magnetic information is not avail¬ 
able. We look for the most probable division of the group into 
two clusters by finding the least positional variance within the in¬ 
dividual clusters. For that, we let a division line, running through 
the box-centre, rotate from 0 = 0 to 0 = 180'’ and obtain - for 
each angle - a cluster of ‘leading’ spots and a cluster of ‘fol¬ 
lowing’ spots. This is achieved by using a vector perpendicular 
to the division line, D = (cos 0, sin 6). The sign of the inner 
product of this vector with the spot vector, S, = (x, - Xg,y/ - yg) 
defines the cluster (‘polarity’) membership of the i-th spot. The 
sum of the two variances of the spots’ coordinates on either side 
of the division line is calculated. We denote the angle at which 
the least positional variance is achieved by 0opt and adopt it as 
the most probable division of the sunspot group into polarities. 
The area-weighted centres of the polarities found are then calcu¬ 
lated. The eastern and western parts correspond to the following 
and leading polarities, respectively, and their coordinate pairs are 
denoted by (xF,yF) and (xL,yL)- The coordinate pairs convert to 
heliographic coordinates (0 f, Tf) and (0 l, Tl), respectively. 

The tilt angle a is computed by 


(yp - w) / (xl - Xp) if Tg > 0 

O'L - yp) / (xl - Xp) otherwise. 


(4) 


where Tg is the heliographic latitude of the box-centre. The tilt 
angles are positive if the leading polarity is nearer to the equator. 
Note that these operations are done in a tangential plane through 
the box-centre. Problems with measuring on a spherical surface 
are thus very small. The tilt angles calculated are called ‘pseudo- 
tilt-angle’ as in case of the Mt. Wil son data becau se magnetic 
polarity information is not available (iHowar IIIMI). 

The polarity separation is then computed on the great circle 
(orthodrome) through the polarities: 

cos A/3 = sin /If sin Ap + cos /If cos Ap cos((f>p - (f>p). (5) 

The tilt angles are calculated for all spot groups with two or more 
spots. 


4.2. Sources of errors 


Differences between various data sources and tilt angle determi¬ 
nations may have the following origins: 

- unipolar groups are assigned a tilt angle erroneously, 

- the method of computation may introduce a bias if the divi¬ 
sion angle through the bipolar group is presumed or preju¬ 
diced, 

- ambiguity of the tilt angle sign due to the lack of magnetic 
information, 

- or incorrect splittings or combinations of groups lead to spu¬ 
rious tilt angles. 


The misidentification of unipolar regions generates a noise com¬ 
ponent whose distributio n is much shallower than the dis tribu- 
tion from bipolar groups (I Wang et al.ll2015^ Baranvill2015h. T he 


peak-to-tail ratio apparent from figure 8 in I Wang et al.l (1201511 is 
about six for unipolar groups with A/3 < 2?5, while it is roughly 
100 for the groups with A/3 > 2°5. In the Schwabe data, we find a 
peak-to-tail ratio of a bit more than two for A/3 < 2?5, while it is 
entirely uniform for A/3 < 1?5 (Fig.|9]l. An interesting exercise is 
the determination of the dependence of the average tilt angle on 
the level of noise that is typically introduced by unipolar sunspot 
groups that are erroneously included when no magnetic informa¬ 
tion is available. We assume that the true tilt angle distribution is 
symmetric around its mean and denote it by S (or, ao, ctq,), where 
ao and ctq, are the true average and width of that distribution, 
respectively. We add the noise as a simple background constant 
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C which mimics the contamination by unipolar groups or other 
spurious tilt angles to simplify the analysis, giving an upper limit 
of the error introduced by those tilt angles. The average tilt angle 
is then 

pnjl I pKjl 

<Q')= a(C + S)da (C + S)da. (6) 

J-71/2 I J-rr/2 


We can replace the constant C by the number of sunspot groups 
that contribute to C as a fraction of the total number of groups 
and denote this fraction by /. We find 


>r(l-/) 


Inserting this into ® leads to 


(7) 



-50 0 50 

Tilt angle (deg) 


nnl2 I nnl2 

{a) = (1 -/) aSda / Sda 

Jn/2 I J 7 T /2 

^ (l-f)ao. (8) 

The approximation coming from the finite-limit integral is bet¬ 
ter than 0?05 up to ao = 42° for ctq, = 20° and up to 15° for 
(Ter - 30° - good enough for any relevant average tilt angles. 
The relation ([8]) tells us that if 10% of the individual tilt angles 
are spurious, the average tilt angle reduces by 10%, e.g. from 
a true value of 5° to a measured v alue of 4?5. Looki ng at the 
cycle-to-cycle variations derived bv lWang et al.l (1201 5h for indi¬ 
vidual data sets, we infer cycle-to-cycle scatters (corrected by 
f-distribution) of 4%, 9%, and 14% for the Debrecen umbral- 
based tilt angles, the Debrecen whole-spot-based tilt angles, and 
the tilt angles from the Mt. Wilson white-light images, respec¬ 
tively. Since the cycle-to-cycle variations of the average tilt an¬ 
gles are that small, the possible contamination of the distribution 
should be assessed, especially when data from different sources 
are combined. Note that 100% noise naturally leads to an aver¬ 
age tilt angle of 0° (without polarity information). The influence 
of unipolar groups can be reduced significantly by excluding all 
groups with apparent separations of AB < 2°5 or even Afl < 3° 

We tried to further reduce the influence of spurious tilt an¬ 
gles by looking at the scatter of tilt angles of a given group dur¬ 
ing its evolution over several days. We denote the individual ap¬ 
pearances of a group over seve ral days as “ i nstanc es”. A simi¬ 
lar procedure was proposed by iLi & Ulrichl (l2012h . On the one 
hand, outliers due to ill-defined groups need to be removed. On 
the other hand, groups become unipolar at the end of their life¬ 
time, but are still large and accompanied by pores, mimicking 
Af5 > 3°. We therefore determine the median tilt angle d from 
the various instances of a given group and determine the average 
deviation from it by 

/ 

Aa = Y^\ai-a\/I, (9) 

1=1 

where I is the number of instances of the group and a, are the 
tilt angles of the individual instances of the group. The tilt an¬ 
gle will be fairly reliable if the polarity separation is large. We 
therefore tested whether the tilt angle at maximum polarity sep¬ 
aration, atnaxsep, does not deviate from the median significantly, 
using the criterion lamaxsep - d\ < Aa. The groups fulfilling this 
criterion are a good guess of the ‘real’ bipolar groups, while oth¬ 
ers are omitted entirely. Now, within the accepted groups, all 
instances with \ai - d| > 2Aa are omitted as outliers. A group 


Fig. 10. Histograms of the tilt angles in the Schwabe data, analysed with 
two different methods. Light bars show the excess of tilt angles from 
our isotropic method searching for the optimum polarity divisi on, while 
dark b ars show the excess of tilt angles from the method bv iHowardI 
l ll991h . Only groups with area weighted centres within ±60° CMD and 
polarity separations A/3 > 3° are used. 


turning unipolar near the end of its lifetime still exhibits scat¬ 
tered spots and pores around the remaining (large-area) polarity 
which will cause quickly changing spurious tilt-angles. Those 
(mostly H-type) groups are not supposed to deliver a tilt angle. 
We will call those cases ‘evolutionary outliers’ in the following. 

The removal of ‘evolutionary outliers’ also requires the deci¬ 
sion on which hemisphere a given group lay, since low-latitude 
groups may have instances on both sides of the equator, lead¬ 
ing to jumps in the tilt angle. We decided upon the hemispheric 
‘membership’ by the average (signed) latitude of the group in¬ 
stances of each group. The signs of the tilt angles of all in¬ 
stances are then computed assuming the single hemisphere ob¬ 
tained from that average latitude, regardless of the actual hemi¬ 
sphere of an individual instance. 

With respect to the method of computing tilt a ngles, we used 
the Sc hwabe data set to compare the method by [Howard et al.l 
(Il984h with our method. The former divides groups always with 
a north-south line leading to a bias with an avoidance of tilt an¬ 
gles near 90°. Our method of trying all possible dividing lines 
is isotropic with equal prior probabilities for tilt angles of 90°, 
0°, and -90°. Figure [T0]shows a comparison of the two methods 
based on the S chwabe data of individual spots. The method of 
iHowardI (Il99lh tends to concentrate tilt angles at lower values. 


Groups which are ‘reversed dipoles’ as compared to the 
typical polarity of a given cycle (anti-Hale groups) cannot be 
detected in white-light images or sunspot drawings. The anti- 
Hale fraction of all gr oups is about 8 % (iLi & Ulrichl 20121: 
McClintock et al.l 1201^ or about 5% (Sokoloff & Khlvstoval 


20101) or even lower ( Sokoloffet al.l 1201 sl) . based on magne- 


togram data. This fraction holds true for large bipolar regions 
though, while the fraction may be as hig h as 50% for ephemera l 
regions with areas less than 50 MSH (llllarionov et al.l 12015h . 
which are not relevant here as they are not accompanied by 
sunspots. In our cleanest distribution without ‘evolutionary out¬ 
liers’, 95% of the groups are larger than 10 MSH in terms of their 
umbral group area. This translates to roughly 100 MSH total 
group area. For this lower limit of group areas, llllarionov et al.l 
(l2015b give an anti-Hale fraction of about 37%, while for 
large groups with 500 MSH or more, that fraction is 10% or 
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Fig. 11. Histograms of the tilt angles of groups with area weighted cen¬ 
tres within 60° central meridian distance. Open bars: entire set, filled 
bars: groups with a minimum polarity separation of A/^min = 3°, and 
hatched bars: groups with = 3° and a removal of ‘evolutionary’ 

outliers that occur in the sequence of tilt angles during the evolution 
of any given group. This hatched histogram is an attempt to further re¬ 
duce the influence of occasional unipolar instances of otherwise bipolar 
groups. See text for the detailed algorithm. The bin width is 2?5. 


Fig. 12. Tilt angle histogram of the Schwabe data for group centres 
within 60° central meridian distance and polarity separations Afi > 3°. 
In this analysis, only spots with umbral areas of 5 MSH or more were 
used to compute the weighted positions of the polarities. The selection 
therefore affects both tilt angles and polarity separations. As compared 
with the hatched bars in Fig. [TT] many groups have now turned into 
unipolar groups, since one of the polarities was represented by a single 
spot of less than 5 MSH. 


less. The unsigned tilt angle distribution of anti-Hale groups is 
broader than that of the Hale groups, but with similar peaks 
(iMcClintock et al.ll2014ll . The influence of the missing knowl¬ 
edge of the polarity on the average tilt angle is therefore rel¬ 
atively mild and not as strong as a similar fraction of random 
noise in the data. 

The definition of wh at exactly is a group yields another 
source of possible errors. I^aranvil ilTOLSh revisited the Mt. Wil¬ 
son and Kodaikanal data sets and compared them with Debre¬ 
cen tilt angles. Among other things, she found that the auto¬ 
mated routine used in the original analysis of the Mt. Wilson 
and Kodaikanal data often splits tr ue group s into two smaller 
ones. While the average tilt angle by [Howara (1 199 Ih (4?2 + 0?2) 
was reproduced as 4?16 + 0.19, a higher value of 4?69 + 0?20 
was found for 1917-1976 when these extra splittings were cor¬ 
rected. In terms of Eq.|9] this new value indicates a random noise 
fraction in the original value of 11%. That period of 1917-1976 
is the one for which the Greenwich Photoheliographic Database 
was used as a reference to define proper groups. For 1974-1985, 
the Debrecen Photoheliographic Database was used to obtain the 
‘true groups’, to give the average of 5?00 + 0?47. Note that the 
difference to 4?69 + 0?20 may actually be real and due to the 
different cycles covered. 

4.3. Distribution and averages 

Figure [TT] shows the resulting distribution of tilt angles. Being 
quite broad, the distribution has its maximum at small, non-zero 
a. Note that the open bars may include tilt angles which are er¬ 
roneously computed for two or more spots of a single polarity, 
fragmented spots, and spots inside the same penumbra. The filled 
histogram therefore shows the distribution of tilt angles with 
polarity separations A/3 > 3°. In this distribution, spots inside 
a common penumbra are essentially excluded, but, on the one 
hand, true bipolar groups with very small polarity separations 
may also be excluded. On the other hand, spurious tilt angles due 
to a decaying big group with a single polarity may still contribute 


to this distribution, but are a very minor fraction (lBaranvil2015h . 
The selection of bipolar groups may perhaps be fine-tuned by 
using an area-dependent minimum polarity separation, but we 
did not wish to impose biases which may affect the distribution 
of polarity separations. A constant minimum separation A/SaCm is 
therefore used for selecting actual bipolar groups. 

The average tilt angle for the distribution with AySmm = 3° 
and with area weighted group centres within +60° CMD is 
4?45 + 0?20 (median 4?69 ± 0?20) where the error of the mean 
is computed by the standard deviation of the distribution di¬ 
vided by the square root of the number of points, ctiaiI sjn. For 
comparison, we may also compute the average tilt angle for 
AjSmin = 0 (again |CMD| < 60°) and obtain 4?01 ± Q°.22 (me¬ 
dian 4?55 + 0?22). The lower value is consistent with our earlier 
supposition that spurious bipolarities add a certain amount of 
randomness to the data, bringing the average closer to z ero. This 
averag e agrees relativel y well with the ones fou nd by [HowardI 
([MB (4?2 + 0?2) and lPasi-Fspuig et al.l (1201 Oh (4?25 ± 0?18 
for Mt. Wilson and 4?51 + 0?18 for Kodaikanal) for solar cy¬ 
cles 15-21 which were all computed without a lower limit, i.e. 
AySmin = 0. 

An analysis of the Debrecen data by iBaranvil (12015h with 
careful extraction of truly bipolar groups delivered 5?12 + 0?46 
for 1974-1985 (end of cycle 20 and cycle 21). Based on a mini¬ 
mum polarity separation of A/SaCm - 3°, we recomputed the Mt. 
Wilson and Kodaikanal averages as well which resulted in val- 
ues of 5?95 ± 0?42 and 6?91 ± 0?45, respectively, for cycle 21. 
Ilvanovi (1201 2h used the Pulkovo database (Catalogue of Solar 
Activity - CSA) for tilt angles in the period 1948-1991. We used 
their database and obtained an average tilt angle of 6?41 + 0?14 
for cycle 21 only. In this sample, the groups were determined 
manually and contain a fairly clean definition of what a group 
is, similar to our analysis of the Schwabe drawings. Since the 
average tilt angle varies from one cycle to the next, we cannot 
compare Schwabe’s tilt angles with 20th-century ones directly, 
but we And that averages of clean tilt angle samples are typically 
5° or larger. 
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Fig. 13. Dependence of the average tilt angle on the unsigned latitude 
for the data from Schwabe, Kodaikanal, Mt. Wilson, and Debrecen. 
Only groups with central meridian distances within ±60° are used as 
the simplest common filter for all datasets. Errors of the means are the 
standard deviation of the bin sample divided by ^fn in each bin. 


The resulting histogram of tilt angles with deleted ‘evolu¬ 
tionary outliers’ (Sect.|4j2| is shown in Fig.[TT]as hatched bars. 
The average tilt angle has risen to 5?85 ± 0?25, based on 7765 tilt 
angles. We consider this the cleanest sample of tilt angles for 
cycles 7-10. The only average this value can be compared with 
now (as f ar as diff e rent c ycles can be compared at all) is the one 
given by iBaranvil (1201 5h where bipolar groups have also been 
selected fairly rigorously. However, that Debrecen tilt angle of 
5°12 is different in that spots smaller than 5 MSH are considered 
as pores which do not enter the determination of tilt angles and 
polarity separations. Figure [12] shows the histogram of the tilt 
angles restricted to spots that have areas > 5 MSH. The aver¬ 
age of 5?45 ± 0?34 (based on 4154 tilt angles) is slightly lower 
than the above value of 5?85 + 0?25, but not significantly. Even 
though the pores seem to play a minor role in determining reli¬ 
able tilt angles, the difference shows that the minimum umbral 
area needs to be considered when combining several data sets. 

The average dependence of the tilt angle on the absolute he¬ 
liographic latitude (Joy’s law) is shown in Fig.[T3| Together with 
the Schwabe data, we also plotted the average tilt angles obtained 
from the Mt. Wilson, Kodaikanal, and Debrecen data, using only 
tilt angles from groups within ±60° CMD. The latitudinal depen¬ 
dence in the Schwabe data may be a bit shallower than the de¬ 
pendences from the other data sets - especially since it exhibits 
a non-zero intersection with the ordinate -, but is in agreement 
with them considering the uncertainty margins. 

The format of the database containing the individual umbrae 
including umbral areas as well as s potless days is gi ven in Ta- 
ble|3]which extends the one given bv lArlt et alJ (1201 3l) . while the 
format of the final data set containing the tilt angles and polarity 
separations of sunspot groups is given in Table |4| Note that we 
give both, the tilt angles obtained according to our method de- 
sc ribed above as w ell as the tilt angles derived with the method 
byE owar ^ (Il991h in the latter table. 

5. Summary 


computing tilt angles of these sunspot groups for that period. 
The fraction of the solar disk covered by the pencil dots in the 
drawings cannot be directly converted into an area in km^ or mil¬ 
lionths of a solar hemisphere (MSH). We therefore constructed a 
mapping of the t welve arbitrary cursor sizes which were used by 
lArlt et all (1201 3h to estimate the sizes of sunspots in the Schwabe 
drawings. For cycles 8-10, we obtain an average umbral area per 
day of 113. The Debrecen data for cycles 21-23, which were pre¬ 
dominantly used for calibration, yield an average of roughly 150. 
The difference appears to be compatible with the stronger cycles 
in the second half of the 20th century and the fact, that Schwabe 
may have overlooked (or not plotted) a number of small spots. 
The umbral areas in the Greenwich Photoheliographic Database 
lead to an average of about 140 from cycles 12-20 of mixed 
strengths. Our area conversion is independent of the Greenwich 
data, but seems to agree with it fairly well, again taking into ac¬ 
count that the Schwabe drawings may miss a few smaller spots. 

The area distribution of the Schwabe sunspots exhibits a 
log-normal distribut ion in agreement with 20th-century data 
dBogdan et al.lll98^ and is essentially independent of the cy¬ 
cle phase. Schwabe’s original sunspot group designations were 
modified so that the groups comply with the modern understand¬ 
ing of a sunspot group, with the limitation of missing mag¬ 
netic information. Using the positions and areas of all individual 
sunspots, the tilt angles as well as the polarity separations of the 
sunspot groups were calculated. Without the magnetic informa¬ 
tion, the definition of the polarities may lead to wrong associa¬ 
tions affecting both tilt angles and polarity separations. The man¬ 
ual inspection of the groups before computing these quantities 
reduces these incorrect polarities as compared to fully automatic 
analysis schemes. Nevertheless, a remaining random component 
in the tilt angle distribution is likely to be present. 

Both an updated sunspot database and a tilt angle database 
are available at http://www.aip.de/Members/rarlt/sunspots for 
further study. We note that in the sunspot database: 

- sunspot areas for spots with CMD < 70° are reliable, 

- sunspot areas for spots with 70° < CMD < 85° are uncertain 
because they are an extrapolation of the statistical method 
employed, 

- sunspot areas for spots with CMD > 85° have been omitted, 

- sunspot areas are calibrated using 20th century data; they 
will not serve for the purpose of detecting differences in ar¬ 
eas between the 19th and the 20th century. 

In the tilt angle database: 

- tilt angles for all groups with two or more spots are reported, 

- tilt angles for groups with CMD < 60° are considered reli¬ 
able as the positions are reliable, 

- tilt angles for polarity separations Aj6 > 3° (POLSP) are 
likely to be bipolar groups and should be selected for further 
analysis. 

- the influence of spurious tilt angles from remaining unipolar 
groups can be further reduced by removing outliers from the 
sequence of tilt angles provided by the evolution of a given 
group. In brief, we removed days of appearance of a given 
group if the obtained tilt angle deviates significantly from 
the mean tilt angle of all appearances of that single group. 
The actual procedure is a bit more involved and is described 
in Sect. 14.21 


This study aims at determining physical areas of sunspots from 
drawings by Samuel Heinrich Schwabe in 1825-1867 as well 
as ordering these sunspots into (hopefully bipolar) groups and 


Joy’s law was found to be obeyed by the likely bipolar groups. 
The latitude dependence averages over all Schwabe cycles is not 
significantly different from cycles in the 20th century. 
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Table 3. Modified format of the data of individual sunspots observed by Samuel Heinrich Schwabe. The format extends the one by I Arlt et al.l 
ll201.3h after column 93. In the Format column, I denotes integer fields, C8 is an 8-character text field, and, e.g., F5.1 denotes a 5-character-wide 
floating point field with one decimal. Areas in UMB are based on the derivation by Q. 


Field 

Column 

Format 

Explanation 

YYYY 

1^ 

14 

Year 

MM 

6-7 

12 

Month 

DD 

9-10 

12 

Day referring to the German civil calendar running from midnight to midnight 

HH 

12-13 

12 

Hour, times are mean local time in Dessau, Germany 

MI 

15-16 

12 

Minute, typically accurate to 15 minutes 

T 

18 

11 

Indicates how accurate the time is. Timeflag = 0 means the time has been inferred by the 
measurer (in most cases to be 12h local time); Timeflag = 1 means the time is as given by the 
observer 

LO 

20-24 

F5.1 

Heliographic longitude of apparent disk centre seen from Dessau 

B® 

26-30 

F5.1 

Heliographic latitude of apparent disk centre seen from Dessau 

CMD 

32-36 

F5.1 

Central meridian distance, difference in longitude from disk centre; contains -.- if line indi¬ 
cates spotless day; contains NaN if position of spot could not be measured. 

LLL.L 

38^2 

F5.1 

Heliographic longitude in the CatTington rotation frame; contains -.- if line indicates spotless 
day; contains NaN if position of spot could not be measured. 

BBB.B 

44-48 

F5.1 

Heliographic latitude, southern latitudes are negative; contains -.- if line indicates spotless 
day; contains NaN if position of spot could not be measured. 

M 

50 

Cl 

Method of determining the orientation. ‘C’; horizontal pencil line parallel to celestial equa¬ 
tor; ‘H’; book aligned with azimuth-elevation; ‘Q’: rotational matching with other draw¬ 
ings (spot used for the matching have ModelLong + ‘ - ModelLat + ‘ - and 

Sigma ‘ - .-'). 

Q 

52 

11 

Subjective quality, all observations with coordinate system drawn by Schwabe get Qual¬ 
ity = 1. Positions derived from rotational matching may also obtain Quality = 2 or 3, if the 
probability distributions fixing the position angle of the drawing were not very sharp, or 
broad and asymmetric, respectively. Spotless days have Quality = 0; spots for which no po¬ 
sition could be derived, but which have sizes, get Quality = 4. 

SS 

54-55 

12 

Size estimate in 12 classes running from 1 to 12; a spotless day is indicated by 0 

GROUP 

57-64 

C8 

Group designation based on Schwabe, but modified by our regrouping 

MEASURER 

66-75 

CIO 

Last name of person who obtained position 

M0D_L 

77-81 

F5.1 

Model longitude from rotational matching (only spots used for the matching have this) 

M0D_B 

83-87 

F5.1 

Model latitude from rotational matching (only spots used for the matching have this) 

SIGMA 

89-93 

F6.3 

Total residual of model positions compared with measurements of reference spots in rota¬ 
tional matching (only spots used for the matching have this). Holds for entire day. 

DELTA 

96-99 

F4.1 

Heliocentric angle between the spot and the apparent disk centre in degrees (disk-centre dis¬ 
tance); it is -.- for spotless days, while it is NaN if the spot position could not be determined. 

UMB 

100-103 

13 

Inferred umbral area in millionths of the solar hemisphere (MSH); it is 0 for spotless days 
and NaN if spot position could not be derived or DELTA > 85° 

A 

105 

Cl 

Flag saying whether area mapping is based on umbral (‘U’) or penumbral (“!’) areas with 
the latter being less certain. Note that the actual area given in UMB is always umbral. Spotless 
days have -. 


The applicability of w hite-light iinages fo r inferring cycle 
properties was doubted by IWang et ^ (l2015h . mostly because 
of the inevitable contamination by actual unipolar groups. We 
believe the white-light images and drawings can still be useful, 
because (a) improved algorithms and visual inspection can re¬ 
duce the impact of unipolar groups significantly, and because (b) 
we are interested in relative variations of tilt angles from one 
cycle to another, so consistently analysed sunspot data can still 
provide useful relative information of cycle-to-cycle variability. 
Care has to be taken that all data used for a particular study are 
consistent with each other, e.g. tilt angles from white-light im¬ 
ages and magnetograms cannot be combined directly into a sin¬ 
gle record. 

We have not studied the tilt angles of the individual cycles 
in this Paper. This will be the subject of a future study of tilt 
angles and strengths of invidual cycles as well as correlations 
thereof, extending ealier works on Mt. Wilson and Kodaikanal 
data. Especially cycle 7 is an interesting candidate for peculiari¬ 


ties, since it occurred shortly after the Dalton minimum (roughly 
1795-1820). The current Paper aims at the disemination of the 
areas and tilt angles as the basis for further studies. 
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Table 4. Format for the tilt angle data derived from the sunspot groups observed by Schwabe, with format symbols as in Table (3] All areas are 
based on the UMB column in Table [S] 


Field 

Column 

Format 

Explanation 
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1-4 

14 

Year 

MM 

6-7 

12 

Month 

DD 

9-10 

12 

Day 

HH 

12-13 

12 

Hour 

MI 

15-16 

12 

Minute; mean local time in Dessau, Germany 

GROUP 

18-25 

C8 

Group name based on Schwabe, but modified by our regrouping 

SP 

27-28 

12 

Number of spots in a group 

ARA 

30-32 

13 

Sum of umbral area of all spots in a group, in millionths of the solar hemisphere (MSH) 

AWL.L 

34-38 

F5.1 

Area-weighted heliographic longitude of the group 

AWB.B 

40-44 

F5.1 

Area-weighted heliographic latitude of the group 

TILTAN 

46-51 

F6.2 

Tilt angle of the group; positive sign means leading polarity closer to equator in either hemi¬ 
sphere. This tilt angle was found using an isotropic search for the most likely dividing line 
between the polarities. 

TILTHO 
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F6.2 

Tilt angle computed as inlHowardl(ll991h for comoatibilitv reasons. It is based on a fixed vertical 
dividing line between the polarities and an approximative formula for the tilt angle. 

POLSP 

60-64 

F5.2 

Polarity separation of the group in degrees on the solar sphere. This and the following items are 
based on the polarity definition for TILTAN. 

FN 

66-67 

12 

Number of spots in the following polarity 

LN 

69-70 

12 

Number of spots in the leading polarity 

FAR 

72-74 

13 

Umbral area of the following polarity, in MSH 

LAR 

76-78 

13 

Umbral area of the leading polarity, in MSH 

FLL.L 

80-84 

F5.1 

Area-weighted longitude of the following polarity 

FBB.B 

86-90 

F5.1 

Area-weighted latitude of the following polarity 

LLL.L 

92-96 

F5.1 

Area-weighted longitude of the leading polarity 

LBB.B 

98-102 

F5.1 

Area-weighted latitude of the leading polarity 

GFC 

104-108 

F5.1 

Heliocentric distance of the group from the disk centre in degrees 
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